Mark input tokens to routed experts as dynamic to avoid a recompile #2007

xmfan · 2025-11-08T05:46:56Z

Stacked PRs:

->Mark input tokens to routed experts as dynamic to avoid a recompile #2007

Mark input tokens to routed experts as dynamic to avoid a recompile

This saves 1 recompile, and you can see the input tokens are dynamic from the first graph compiled:

class GraphModule(torch.nn.Module):
    def forward(...s77: "Sym(s77)", L_x_: "bf16[s77, 5120][5120, 1]cuda:0"...

I verified that this also fixes the AC recompile issue of: #1971. But I'm keeping torch._C._dynamo.eval_frame._set_lru_cache(False), as there could be other recompile reasons popping up.

stack-info: PR: #2007, branch: xmfan/stack/3

jquesnelle · 2025-11-10T16:18:11Z

the fix for #1971 requires using PyTorch nightly as of a few days ago (_set_lru_cache just added) so this may be a preferrable fix to allow for using PyTorch stable (2.9)

wwwjn

This change LGTM! nit: Can you explain more why we still need torch._C._dynamo.eval_frame._set_lru_cache(False) to fix #1971? If no specific reason, should we remove it?

tianyu-l

sgtm

xmfan · 2025-11-17T21:57:31Z

This change LGTM! nit: Can you explain more why we still need torch._C._dynamo.eval_frame._set_lru_cache(False) to fix #1971? If no specific reason, should we remove it?

There are many issues like #1971 that can pop up, and they may not have good error messages. Keeping torch._C._dynamo.eval_frame._set_lru_cache(False) in the codebase will protect against all of them.

Mark input tokens to routed experts as dynamic to avoid a recompile

f6012ae

stack-info: PR: #2007, branch: xmfan/stack/3

xmfan requested review from fegin, tianyu-l, wconstab and wwwjn as code owners November 8, 2025 05:46

xmfan force-pushed the xmfan/stack/3 branch from 3c5d17b to f6012ae Compare November 8, 2025 05:46

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Nov 8, 2025

wwwjn approved these changes Nov 16, 2025

View reviewed changes

tianyu-l approved these changes Nov 16, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Mark input tokens to routed experts as dynamic to avoid a recompile #2007

Mark input tokens to routed experts as dynamic to avoid a recompile #2007

Uh oh!

xmfan commented Nov 8, 2025 •

edited

Loading

Uh oh!

jquesnelle commented Nov 10, 2025

Uh oh!

wwwjn left a comment

Uh oh!

tianyu-l left a comment

Uh oh!

xmfan commented Nov 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Mark input tokens to routed experts as dynamic to avoid a recompile #2007

Are you sure you want to change the base?

Mark input tokens to routed experts as dynamic to avoid a recompile #2007

Uh oh!

Conversation

xmfan commented Nov 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jquesnelle commented Nov 10, 2025

Uh oh!

wwwjn left a comment

Choose a reason for hiding this comment

Uh oh!

tianyu-l left a comment

Choose a reason for hiding this comment

Uh oh!

xmfan commented Nov 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

xmfan commented Nov 8, 2025 •

edited

Loading